# Video Instruction Understanding
Smolvlm2 2.2B Instruct GGUF
Apache-2.0
SmolVLM2-2.2B-Instruct is a 2.2B parameter vision-language model focused on video-text-to-text tasks, supporting English.
English
S
mradermacher
235
0
Smolvlm2 256M Video Instruct Mlx
Apache-2.0
This is a video-text-to-text model converted based on the MLX framework, suitable for video understanding and instruction-following tasks.
Image-to-Text
Transformers English

S
mlx-community
591
7
Smolvlm2 500M Video Instruct Mlx
Apache-2.0
This is a video-text-to-text model based on the MLX format, developed by HuggingFaceTB, supporting English language processing.
Image-to-Text
Transformers English

S
mlx-community
2,491
12
Featured Recommended AI Models